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AMENDMENTS TO THE CLAIMS: 

This listing of claims will replace all prior versions, 
and listings, of claims in the application: 

LISTING OF CLAIMS : 

1-21 . (canceled) 

22. (new) A method for gene mapping chromosome and 
phenotype data from a database, comprising analyzing linkage 
disequilibrium between genetic marks lUi, 

i) searching from the data for all marker patterns P that satisfy 
a pattern evaluation function e (P) , wherein 

a: the marker patterns are expressions within the 
database comprising genetic markers and their alleles and zero or 
more of the following: individual covariates, environmental 
variables and auxiliary phenotypes; and 

b: the pattern evaluation e(P) is a measure of the 
association between the marker pattern P and a phenotype being 
studied, 

ii) scoring each marker lUi of the data with a marker score s (mi) , 
which is a function of the set Si defined as the set of marker 
patterns overlapping the marker mi and satisfying the pattern 
evaluation function e as defined in step i), and 
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iii) mapping the location of a gene by evaluating the scores 
s (m±) of ail the markers lUi in the data which is determined by 
maximizing the score if the scoring function is designed to give 
higher scores closer to the gene, and on minimizing the score if 
the scoring function is designed to give lower scores closer to 
the gene. 



23. (new) The method of claim 22, wherein the 
chromosome data consists of either haplotypes or genotypes. 

24. (new) The method of claim 23, wherein said 
haplotypes and genotypes contain flexible regions. 



25. (new) The method of claim 22, wherein the marker 
patterns P are searched by the following algorithm: 
Input 

• set of U marker patterns 

• evaluation function e(P) for patterns P In U 

• (generalization) relation < for patterns in U 

• where the function e and the relation < are such that if e(P) 
is true and < P, then e(P') is also true 



Output 
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set S {P z Ul e (P) is true} of patterns 
Method 
S : = {} 

// Initialize the set of evaluated patterns: 
E : = {} 

// Start with the most general patterns: 

Gen := {P in U} \ there is no P' in P' ! = P, such that P' < P} 
// Recursively evaluate patterns in a depth first order: 
foreach P e Gen { evaluatePatterns ( P) } 
end; 

procedure evaluatePatterns ( P) { 

insert P into the set E 

if e(P) = true then { 
insert P into set S 

// Find all specializations of P that have not been tested yet, 
and 

// evaluate them recursively; 

Spec := {P' in U-E \ P < P' , P' \^ P, and there is no P' ' in U-E, 
P' ' \=P 

and P"!= P' , with P < P' ' < P' } ; 
foreach P' in Spec { evaluatePatterns ( P' ) ; } 
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26. (new) A method of claim 22, wherein the marker 
patterns P are searched by the following algorithm: 

Input 

• set (7 of marker patterns 

• evaluation function e(P) for patterns P In U 

• frequency threshold x 

Output 

• set S ^ {P in U \ e(P) and ae(P) is true} of patterns, where 
ae(P) is true if and only if the frequency of pattern P exceeds a 
given threshold x 

Method 

S : = {} 

// Initialize the set of evaluated patterns: 
E : ^ {} 

// Start with the most general patterns: 

Gen : = {P in a I there is no P' in U, P' ! = P, such that P -> P' } 
//Recursively evaluate patterns in a depth first order: 
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foreach P in Gen { evaluatePatterns ( P) } 
end 

procedure evaluatePatterns { P) { 
insert P into the set E 
if ae(P) = true then { 
if e(P) = true then insert P into set S 

// Find all specializations of P that have not been tested 
yet, and evaluate 

// then recursively: 

Spec := {P' in U~E \ P' -> P, P'!= P, and there is no P' ' in 
U-E, P''\=P 

and P''[=P', with P' -> P' ' and P' ' -> P} 
foreach P' in Spec { evaluatePatterns { P' ) } 

27. (new) The method of claim 22, wherein the marker 
patterns P are searched by the following algorithm: 

Input 

• marker map M = (mx^ ... , luj^) 

• phenotype vector Y = (Yi, Yn) 

• haplotype matrix H of size n k 
association threshold x for chi-squared test 
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maximum pattern length 1 

• maximum number of gaps g 

• maximum gap size s 

Output 

• set S = { P in U \ e(P) is true] of patterns, 

• where U consists of patterns on M that consist of marker- 
allele assignments and that adhere to parameters i, and i, and 

• where e(P) is true if and only if chi-squared test on P using 
haplotype matrix H and phenotypes Y exceeds the given threshold x 

Method 

S : - { } 

// Number of case and control chromosomes: 

piA := number of disease-associated chromosomes; 

pic := number of control chromosomes; 

pi ;= piA + pic 

//A lower bound for pattern frequency: 
lb : piA '''pi X / (pic pi + pIa 

// Variable for iterating over different patterns: 

P = (Pir ... r Pk) :-('^'. 
For i := 1 to k { 
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// alleles ("jUi; is the set of alleles of the i : th marker 
f oreach a in alleles {lUx) { 
Pi a 

// Test pattern P and all its extensions: 
checkPatterns { O) 
// Reset Pi'. 
Pi 
} 
} 

end 

// Test haplotype pattern P and all pattens that can be generated 

by extending P 

// from the right: 

procedure checkPatterns ( starts nr-of-gapsr gap-length) { 
// Output strongly associated patterns 

if chi-squared ( P, M, H, Y) >= x and Pi ! = ' ' then insert P into 
set S 

// Return if extended patterns would be too long: 
if i = k or i-hl-start > 1 then return 

// Return if extended patterns cannot be strongly disease- 
associated : 

if frequency of P in disease-associated chromosomes is less than 

lb 
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then return; 

// Create and test legal extensions of current pattern P (3 
cases ) : 

// 1. Give a marker i+1 all possible values: 
f oreach a in alleles (m±^i) { 

checkPatterns (P, starts 1-^1^ nr-of-gaps^ O) 
} 

// 2. Introduce a new gap starting at marker i+l: 
if Pi and nr^of^gaps < g and s > I then { 

Pi+1 : ^ 

checkPatterns {P, start, nr^of^gaps+ 1, 1) 

} 

// 3. Extend the current gap over marker i+l: 
if Pi = and gap-length < s then { 

Pi + 1 := 

checkPatterns (P, start, nr^of-gaps, gap-length+1) 

} 

// Before returning, reset pi-f-1: 

Pi+1 := ' ^' 

return 

} 
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28. (new) The method of claim 22, wherein the marker 
patterns P are searched by the following algorithm: 

Input 

• set U of marker patterns 

• evaluation function e (P) for patterns P in U 

• (generalization) relation < for patterns in U, where the 
function e and the relation < are such that if e(P) is true 
and P' < P, then e(P^) is also true 

Output 

• set 5 = {P in [/ I e (P) is true] of patterns 
Definitions 

• function Lgg : U -> 2^, Lgg(P) ^ { P' in U \ P > P' and P' ! = 
P and there is no P' ' in U such that p i-= p' ' \= p' and P > 
pf / ^ P' ] , the set of least general generalizations of pattern 
P 

• function Lss: U -> 2^, Lss (P) = { P' in U I P < P' and P' ! = 

P and there is no P' ' in U such that P 1= P' ' != P' and P < 
pf f < p' ] ^ the set of least special specializations of pattern 
P 

Method 
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S : = {} 
Q : = {} 

// Start with the most general patterns: 

F : = {P in U \ there is no P' in U , P' != P, such that P' < P} ; 
while F ! = { } { 

// Evaluate the candidate patterns: 
foreach P in F { 

if e(P) = true then insert P into set S 
else remove P from set F 

} 

Q : = Q union F 

// Generate a new set of candidate patterns: 
C : = {} 

foreach P in F { 

C : - C union { P' in C7 | P' in Lss (P) and for all 
P' ' in Lgg (P' ) : 

P' ' in Q } 

} 

F := C 

} 

end 
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29. (new) The method of claim 22, wherein the marker 
patterns P are searched by the following algorithm: 

Input 

• set U of marker patterns 

• evaluation function e(P) for patterns P xn U 

• frequency threshold x 

Output 

• set 5 = {P in y I e(P) and ae(P) is true} of patterns, where 
ae(P) is true if and only if the frequency of pattern P 
exceeds a given threshold x 

Definitions 

• function Lgg: U -> 2^, Lgg(P) = { P' xn U \ P -> P' and P' ! = 
P and there is no P' ' In U such that P \== P' ' != P' and P -> 
P' ' -> P' } f the set of least general generalizations of 
pattern P 

• function Lss: U -> 2", Lss (P) = / P' in y | P' -> P and P' ! = 
P and there is no P' ' in U such that P ! = P' ' != P' and P' -> 
P' ' -> P} , the set of least special specializations of pattern 
P 
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Method 
S : = {} 
Q : = {} 

// Start with the most general patterns: 

F := {P in U \ there is no P' in U, P' != P, such that P -> P' }; 
while F ! = { } { 

// Evaluate the candidate patterns: 
foreach P in F { 

if ae(P) = true then 

if e(P) = true then insert P into set S 

) 

else remove P from set F 

} 

Q : = Q union F 
// Generate a new set of candidate patterns: 
C : = { } 

foreach P in F { 

C : = C union { P' in U \ P' in Lss (P) and for all 
P' ' in Lgg(P' ) : 

P' ' in Q } 

} 

F : = C 

} 
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end 



30. (new) The method of claim 22, wherein 

a) the phenotype being studied is qualitative, and 

b) the pattern evaluation function e(P) has the form e(P) = 
true if and only if e'(P) > x, where e ' fP) is the (signed) 
association measure x is a user specified minimum 
value, and 

c) the score s (mj_) of marker m-i is the size of Si, also 
called marker-wise pattern frequency of m± and denoted by 
f(ini) . 



31. (new) The method of claim 22, wherein 

a) the pattern evaluation function e (P) has the form 
e(P) = true if and only if e'(P) > x, where e*(P) is the 
absolute frequency of pattern P in the data and x is a user- 
specified value, 

b) in order to derive the score s (m±) , the p value 
(statistical significance) of each marker pattern P in 
determining the phenotype being studied is evaluated, and 

c) the score s (nij) is the distance between the 
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observed p value distribution of patterns in Si and the 
uniform distribution, defined as average of (pi - q±) log (p^ 
/ Qi) over all i = l..n, where n is the number of haplotype 
patterns in S^^ Pi is the ith smallest p value in S±, and q± is 
the expectation of the ith smallest p value , if the p values 
were randomly drawn from the uniform distribution. 

32. (new) The method of claim 31, where the p 
value is computed using a linear model of form Y = j3iXi + ... + fi^Xj^ -h 
aZ -h /3o, where the dependent variable Y is the phenotype being 
studied, Xi through Xj^ are covariates, and Z is a dummy variable 
for the occurrence of the haplotype pattern, and 

the coefficients a and J3* are adjusted for best fit, and then 
the significance of Z as a covariate is assessed using a t test 
with the null hypothesis "or = 0". 

33. (new) The method of claim 22, further refining 
each score s (m-i) by replacing it by the marker-wise p value of the 

score s (m-i) , where the statistical significance of s (lUj^) is 

measured against the null hypotheses that there is no gene effect. 

34. (new) The method of claim 22, wherein an area 
returned from a prediction of a gene location is contiguous or 
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fragmented or a point. 

35. (new) The method of claim 22, wherein the 
location of the gene, predicted as a function of the scores s (inj_) 

and based on maximizing or minimizing the score, is predicted by 
the combination of most probable intervals for containing a trait- 
susceptibility locus that covers a proportion t ^'te { 0, 100% } ) of 
the original region obtained by taking all such points in the 
studied chromosomal region whose nearest marker is within the k 
best scoring markers, and wherein k is selected so that the 
resulting area has length at most t times the length of the 
studied region. 

36. (new) The method of claim 22, wherein the 
location of the gene, predicted as a function of the scores s (mj_) 

and based on maximizing or minimizing the score, is predicted to 
those points in the studied chromosomal region whose nearest 
marker scores at least y or at most y, where y is scoring function 
dependent and is selected so that the probability of the gene 
being close to the marker is determined. 

37. (new) The method of claim 22, wherein the 
location of the gene, predicted as a function of the scores s (inj_) 

and based on maximizing or minimizing the score, is determined by 
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evaluating the marker scores or by visualization. 

38. (new) The method of claim 22, further 
comprising searching for multiple genes by evaluating marker 
patterns that refer to several potential gene loci at the same 
time . 



39. (new) A computer-readable data storage medium 
having computer-executable program code stored thereon operative 
to perform the method of claim 22 when executed on a computer. 



40. (new) A computer system having executable 
program code that performs the method of claim 22 . 



